Picture for Bing Hu

Bing Hu

When Think-with-Image Meets Safety: What Determines Multimodal Jailbreak Robustness?

Add code
May 27, 2026
Viaarxiv icon

Seizure-Semiology-Suite (S3): A Clinically Multimodal Dataset, Benchmark, and Models for Seizure Semiology Understanding

Add code
May 21, 2026
Viaarxiv icon

From Abstraction to Instantiation: Learning Behavioral Representation for Vision-Language-Action Model

Add code
May 21, 2026
Viaarxiv icon

Chain of Risk: Safety Failures in Large Reasoning Models and Mitigation via Adaptive Multi-Principle Steering

Add code
May 07, 2026
Viaarxiv icon

Structure-to-Image: Zero-Shot Depth Estimation in Colonoscopy via High-Fidelity Sim-to-Real Adaptation

Add code
Feb 25, 2026
Viaarxiv icon

Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

Add code
Feb 22, 2026
Viaarxiv icon

SynQP: A Framework and Metrics for Evaluating the Quality and Privacy Risk of Synthetic Data

Add code
Jan 17, 2026
Viaarxiv icon

LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant

Add code
Mar 05, 2025
Figure 1 for LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
Figure 2 for LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
Figure 3 for LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
Figure 4 for LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
Viaarxiv icon

Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep Molecular Understanding

Add code
Aug 14, 2024
Viaarxiv icon

Bug In the Code Stack: Can LLMs Find Bugs in Large Python Code Stacks

Add code
Jun 21, 2024
Viaarxiv icon